[API] Add DeepOCR pipeline API provider by leejooan · Pull Request #1473 · open-compass/VLMEvalKit

leejooan · 2026-03-04T07:38:30Z

Summary

Add support for DeepOCR pipeline so VLMEvalKit can run evaluations
using DeepOCR's document processing pipeline via an OpenAI-compatible
chat completions API.

The DeepOCR pipeline combines deep document OCR with a large
vision-language model, making it especially strong on document
understanding and text recognition tasks.

OCRBench result: 91.7 / 100

Changes

vlmeval/api/deepocr_api.py: New DeepOCRAPI class.
Uses DEEPOCR_API_BASE with Bearer DEEPOCR_API_KEY.
vlmeval/api/__init__.py: Export DeepOCRAPI.
vlmeval/config.py: New model entry:
- DEEPOCR

Adds `DeepOCRAPI`, an OpenAI-compatible wrapper for the DeepOCR pipeline. Credentials are configured via environment variables `DEEPOCR_API_BASE` and `DEEPOCR_API_KEY`. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

leejooan · 2026-03-04T08:12:26Z

Hi,

I’ve emailed the OpenCompass team (opencompass@pjlab.org.cn) with the environment
variables required to run this provider.

If you need any additional setup or have questions, please feel free to reach out
at lja@koreadeep.com.

We look forward to your review.

mzr1996 · 2026-03-04T08:18:00Z

vlmeval/api/deepocr_api.py

+
+    def __init__(
+        self,
+        model: str = "gpt-4-1106-vision-preview",


The model name is 'gpt-4-1106-vision-preview'? better to use your own name, becasue it's related to the result file name.

The model name is 'gpt-4-1106-vision-preview'? better to use your own name, becasue it's related to the result file name.

Thanks for the feedback! I've updated the default model name from
"gpt-4-1106-vision-preview" to "deepocr" in the latest commit.

The previous name was only a placeholder to indicate OpenAI-compatible
format support. Using "deepocr" is more appropriate as the actual
model identifier and will align with the generated result/output names.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

[API] Add DeepOCR pipeline API provider

f9346ab

Adds `DeepOCRAPI`, an OpenAI-compatible wrapper for the DeepOCR pipeline. Credentials are configured via environment variables `DEEPOCR_API_BASE` and `DEEPOCR_API_KEY`. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

mzr1996 reviewed Mar 4, 2026

View reviewed changes

[API] Fix default model name for DeepOCRAPI

55d2ab0

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[API] Add DeepOCR pipeline API provider#1473

[API] Add DeepOCR pipeline API provider#1473
leejooan wants to merge 2 commits intoopen-compass:mainfrom
leejooan:feat/DEEPOCR-api-provider

leejooan commented Mar 4, 2026

Uh oh!

leejooan commented Mar 4, 2026

Uh oh!

mzr1996 Mar 4, 2026

Uh oh!

leejooan Mar 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

leejooan commented Mar 4, 2026

Summary

Changes

Uh oh!

leejooan commented Mar 4, 2026

Uh oh!

mzr1996 Mar 4, 2026

Choose a reason for hiding this comment

Uh oh!

leejooan Mar 4, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants